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Abstract 

We derive an adiabatic theorem for Markov chains using well known facts about 
mixing and relaxation times. We discuss the results in the context of the recent devel- 
opments in adiabatic quantum computation. 

1 Introduction 

The adiabatic theorem was first stated by Max Born and Vladimir Fock in 1928. It 
asserts that "a physical system remains in its instantaneous eigenstate if a given per- 
turbation is acting on it slowly enough and if there is a gap between the eigenvalue and 
the rest of the Hamiltonian's spectrum" (see Wikipedia page on adiabatic theorem). 

Since under certain conditions, there is a way of expressing Hamiltonians via re- 
versible Markov chains, we state and prove the corresponding theorem for Markov 
chains. The task turned out to be less complicated than it was initially expected, and 
we used only a small fraction of the machinery of mixing times and relaxation times 
of Markov chains that was developed so successfully in the last thirty years (see PJ, 
[2]> [S], [7], [TO], |13j and references therein). Stated in terms of Markov chains, the 
adiabatic theorem is intuitively simple and accessible (see [4] for comparison). Here 
we will quote Nielsen and Chuang [12] : "One of the goals of quantum computation 
and quantum information is to develop tools which sharpen our intuition about quan- 
tum mechanics, and make its predictions more transparent to human minds". It is 
important to point out that despite clear similarities and relation between the two, the 
quantum adiabatic theorem and the adiabatic theorem of this paper are different. 

First we state a version of the adiabatic theorem. Given two Hamiltonians, Jiinitiai 
and Ti. final, acting on a quantum system. Let TC(s) = (1 — s)TLi n itiai + sTL final- Suppose 
the system evolves according to 7i{t/T) from time t = to time T (the so called 
adiabatic evolution). The adiabatic theorem of quantum mechanics states that for T 
large enough, the final state of the system will be close to the ground state of Ti final- 
They are e close in I2 norm whenever T > 4^-, where j3 is the least spectral gap of Ti{s) 

*Department of Mathematics, Oregon State University, Corvallis, OR 97331-4605, USA 
kovchegyOmath . oregonstate . edu 



1 



Y.V.Kovchegov 



A note on adiabatic theorem for Markov chains 



over all s G [0, 1], and C depends linearly on a square of the distance between Hinitiai 
and H final- See [8] and [11]. There are many versions of the adiabatic theorem. 

Now, let us list the main concepts involved in the theory of mixing and relaxation 
times that we will need in the following sections. We refer the readers to |10| and [3] 
for details. 

Definition 1. If \i and v are two probability distributions over f2, then the total 
variation distance is 



fX - U \\ TV = 




Observe that the total variation distance measures the coincidence between the distri- 
butions on a scale from zero to one. 

Definition 2. Suppose P is an irreducible and aperiodic finite Markov chain with 
stationary distribution it, i.e. irP = it. Given an e > 0, the mixing time t m i x {e) is 
defined as 

t m ix( e ) = fnf {t '■ \\vP — tt\\tv < for all probability distributions ^} 
Now, suppose P = \p(x,y)) is reversible, i.e. P satisfies the detailed balance 

v / x,y 

condition 

ir(x)p(x,y) = ir(y)p(y, x) for all x,y in the sample space fi 

It is easy to check that if P is reversible, then P is self-adjoint with respect to an 
inner product induced by it, and as such will have all real eigenvalues. The difference 
(3 = 1 — |A( 2 )| between the largest eigenvalue (i.e. Ai = 1) and the second largest (in 
absolute value) eigenvalue A(2) is called the spectral gap of P. The relaxation time is 
defined as 

_ 1 

T~rlx — "7j 


One can show the following relationship between mixing and relaxation times. 

Theorem 1. Suppose P is a reversible, irreducible and aperiodic Markov chain with 
state space Q and stationary distribution ir. Then 

(Trix ~ l)log(2e) _1 < t mix (e) < r Hx log(e min 7r(x)) _1 

xefi 



To conclude the introduction, we would like to point out that the mixing and 
relaxation time techniques were widely used in the the theory of computations and 
algorithms (see [13]). This current paper was influenced by the work of D.Aharonov 
et al [lj. The model of adiabatic quantum computation is based on the quantum 
adiabatic theorem. The connection between the adiabatic quantum computation and 
relaxation times of reversible Markov chains was used in [1] to prove an equivalence 
between adiabatic quantum and quantum computations. See [9] and [12] for more on 
quantum computation. 
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2 Hamiltonians, Markov chains, and adiabatic 
quantum computation 

Let the Hamiltonian (or energy) TL be a Hermitian real matrix. If it is also true that 
/ — TL is nonnegative, irreducible and aperiodic (i.e. (I — TL) k > for k large enough), 
by Perron's theorem, its largest (in absolute value) eigenvalue is positive with a unique 
eigenvector with all positive coordinates. 

Let I — TL = [a,i,j^ be an irreducible and aperiodic nonnegative (N + 1) x (N + 1) 
dimensional Hamiltonian matrix, with eigenvalues 

1 - A > 1 - Ai > 1 - A 2 > ■ ■ ■ > 1 - Ajv, 

where Xj are eigenvalues of the Hamiltonian TL, the lowest of them is the ground energy 
Aq, and by Perron's theorem, 1 — Aq > |1 — \j\ (j = 1, . . . , N). Then 



(1 - A )oj "7 

where the ground state a = (a^, . . . , a n ) with all a,j > is the unique eigenvector of 
TL corresponding to Ao, will be an irreducible aperiodic reversible Markov chain over 
(N + 1) states, with eigenvalues 

r = 1 > ri > r 2 > ■ ■ ■ > r N , 

where rj = j^j^, and stationary distribution 



n = TT~ii2~( a 0>- • • 



( J 2 



Thus if I — 7i nonnegative, irreducible and aperiodic, we can convert the Hamiltonian 
into a reversible transition probability matrix. Observe that the spectral gap (3 = 
1 — max{|ri|, \r^\} of P will be a multiple of the spectral gap (3h = X± — Xq of H, 



1-An 



whenever n > \vn\. If we consider a lazy Markov chain P = ^(J + P), then the 
eigenvectors of P will coincide with the eigenvectors of P, and its eigenvalues will be 

f = 1 > fi > f 2 > ■ ■ ■ > r N > 0, 

1-hr ■ ~ 

where fj = The lazy Markov chain P evolves twice slower than P, and its 

spectral gap 

P = 1 -max{|fi|, |fjv|} = 1 ~h = 



2(1 -Ao) 
Observe that 

(I-A^HA), 



1- Ao 
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where 

/ a ... \ 
A _ ai '•• 

: '•• '•• 

\ a N J 

In quantum computation, if we operate on n qubits (quantum bits), the Hamiltonian 
will be a 2 n x 2™ matrix. In this case, if the conditions are satisfied, and we can convert 
the Hamiltonian into a Markov chain, the stationary distribution ir will represent the 
quantum probabilities for the multiple qubits, |0 . . . 00 >, |0 . . . 01 >,...,|l...ll > 
We can also consider a restriction of the Hamiltonian to an invariant subspace, that 
will have the needed properties. 



In the model of adiabatic quantum computation, an algorithm is described by the 
adiabatic evolution 

H(s) = (1 - s)TCi n itial + SH fi na l, 

where the input is the ground state (the eigenvector corresponding to the lowest eigen- 
value of) Tiinitiah an d the desired output is the ground state o£Hfi na i. Thus, according 
to the quantum adiabatic theorem, if the duration time T is large enough, then the out- 
put will be e close to the ground state of Ttfinal- The duration time (Tmax s ||W(s)||) 
required will be the running time for the algorithm (according to some definitions). 
Therefore finding the optimal T e would optimize the algorithm's running time. In [I], 
D.Aharonov et al use Markov chain techniques to estimate the spectral gap for the 
Markov chain Pn na i generated from ?i final, an d therefore, obtaining the estimate for 
the spectral gap of TC(s) when s is close enough to one, and subsequently using the 
quantum adiabatic theorem to bound the running time. 



Now, suppose TL is a Hermitian real matrix that satisfies the required conditions, i.e. 
(I - H) k > for k large enough. Then (1 - X )P = I - A~ l HA and 



where 



D 



( 7T 
7T1 



H = I-(1-\ )D-2PD2. 



\ 








-A 2 and P = OPD' 1 



a 



V ir N J 

Here, since ir = (ttq, . . . ,ttn) is the stationary distribution of reversible P, the matrix 
P will be a self-adjoint Markov chain with the same spectrum as P. 



In the above discussion we summarized a connection between the Hamiltonians and 
Markov chains in the context of quantum information. 
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3 Adiabatic theorem for Markov chains 

Given two transition probability operators, Pinitial and F 'final, with a finite state space 
fi. Suppose Pfinai is irreducible and aperiodic and 7Tj is the unique stationary distri- 
bution for Pfinai- Let 

Ps (1 s) Pinitial $ Pfinai 

Definition 3. Given e > 0, a time T e is called the adiabatic time if it is the least T 
such that 

max \\vP1_P2_ ■ ■ ■ Pt-iP\ — nt \\tv < e, 

V T T T 

where the maximum is taken over all probability distributions v over £l. 
Theorem 2. Let t m i x denote the mixing time for Pfi na i- Then the adiabatic time 

rp _ q I ^mix(e/2) 

Proof. Observe that 

where vjq = vPx.Pi. ' ' ' Pr an d £ is the rest of the terms. Hence, by triangle inequality, 

T T T 

max\\vPiP2---PT_ 1 P 1 -7Tf\\ TV < max \\vPjr% - Tr f \\ TV ■ mrr _ N + S N , 
where < Sn < 1 - jjTfr^w- 

Let T = Kt mix (e/2) and N = (K - 1 )t mix (e/2), so that max,, \\vPjr% -ir f \\ T v < e/2. 
Observe that 

J^logxdx+(T-N)logT < r! _ p T,j= N +i l( >Sj+(T-N)lo S T < , 

N\T T ~ N 

and therefore, expressing T and N via £m«:r(e/2) , and simplifying, we obtain 



/ / K-l\ tmix(e/2) 



iVlog£-(T-7V) < Tl 1 

N\T T ~ N 



So 

K— 1\ t m i x (e/2) 



and we need to find the least -ftT such that 



1 - 



\ K-l \ *mix(e/2) 



< e/2 
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Now, since log(l + x) = x — %- + 0(x 3 ), the least such K is approximated as follows 

~ -21og(l-e/2) ~ e 
Thus for T = Kt mix {e/2) « tm " ( 6 e/2)2 , 

max llz'-Pi-Pi • • • Pt-i P] — 7Tf ||rv < e 

j, T T T 



□ 

Observe that Theorem [2] is independent of the distance. We can use £2 norm in the 
definitions of adiabatic and mixing times, and arrive to the same result. 

While a version of the quantum adiabatic theorem can be proved without a gap 
condition (see [S]), the Markov chain Pfi na i over finite sample space f2, if it is reversible, 
will have a spectral gap. Here we apply Theorem Q] to the result in Theorem [2j 

Corollary. Suppose Pmitiai an d Pfinai are Markov chains with state space $7. If Pfinai 
is reversible, irreducible and aperiodic with its spectral gap (3 > 0, then 



O 



'log ^ + log —. ± — ^ 



4 Continuous time Markov processes 

In the case of continuous time Markov chains, an equivalent result is produced via 
the method of uniformization and order statistics. Suppose Q is a bounded Markov 
generator for a continuous time Markov chain P(t), and A > maxj e n Ylj-.j^i l(h j) ls 
the upper bound on the departure rates over all states. The method of uniformization 
provides an expression for the transition probabilities P(t) as follows: 

00 (\t) n 1 
P ^ = E e ~ A<P A , where P\ = I + —Q 

n=0 

The expression is obtained via conditioning on the number of arrivals in a Poisson 
process with rate A. 

The definition of a Mixing time is similar in the case of continuous time processes. 

Definition 4. Suppose P(t) is an irreducible and finite continuous time Markov chain 
with stationary distribution ir. Given an e > 0, the mixing time t m i x (e) is defined as 

tmix(t) = inf {£ : ||i/P(i) — vr||ry < e, for all probability distributions is} 

Suppose Qinitial an d Q final are two bounded generators for continuous time Markov 
processes over a finite state space £1, and vrj is the only stationary distribution for 
Q final- Let 

Q[s] = (1 — s)Qi nit ial + sQ fi na i 
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be a time non-homogeneous generator. Given T > 0, let Pr(ii,t2) (0 < t\ < ti < T) 
denote a matrix of transition probabilities of a Markov process generated by Q [y] 
over [ti , ^2] time interval. 



Observe that the continuous time Markov adiabatic evolution is governed by 

, te[0,T] 



-Tt =VtQ 



while the quantum adiabatic evolution is described via the corresponding Schrodinger's 
equation ^ = -iv t H T (y). 

Definition 5. Given e > 0, a time T e is called the adiabatic time if it is the least T 
such that 

max ||z>P T (0, T) — irfWrv < e > 
where the maximum is taken over all probability distributions v over fl 
We will state and prove the following adiabatic theorem. 

Theorem 3. Let t m i x denote the mixing time for Q final- Take A such that 

A > max qinitiai(i,j), A > max V g /mnl (i,j) and A > — — ^— — + 1, 

3-3^ 

where q in itial(h j) and qf ina l(hj) are the rates in Q in ui a l and Qf ina i respectively. Then 
the adiabatic time 

< ^t mix (e/2) 
t ~ e 

Proof. Observe that A > maxj e n Ylj-j^i Qt{hj), where qt{i,j) are the rates in Q [y] 
(0 < /. < T). 

Let T = Kt mix (e/2) and N = (K — l)t mix (e/2), so that 

max \\vPfinal{T -N) - Tr f \\ TV < e/2, 

where Pfi na i (t) = e tc ^f inal denotes the transition probability matrix associated with the 
generator Qf ina i. 

Now, let Pq = I + jQinitial and P 1 = I + jQf ina i. Then P and Pi are discrete 
Markov chains and 

uP T (0,T) = u N P T (N,T) = u N (jr {X{T - N)n e-* T - N h n ) , 

\n=0 n ' / 

where vn = vPr(0,N) and 

/„ = j= / • / [(l-xi/T)i%+(ari/T)Pi] . . . [(l-x n /T)P +(sn/r)Pi]da;i ...dx n 

(1 - iV ) J JN<x 1 <x 2 < - <x n <T 
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i.e. the order statistics of n arrivals within the [N, T] time interval. We used the fact 
that, when conditioned on the number of arrivals, the arrival times of a Poisson process 
are distributed as an order statistics of uniform random variables. Hence 



/ °° e -\(T-N) ,-T ,r \ 

vP T (0,T) = v N r£(\(T - N) n ■ ( T _ N)nT n n] P i J N •••y jv x 1 ...x n dx 1 ...dx n \ +£ 



e 



-A(T-JV) rT fT 

A(T -NT-—- 

\n=0 \ 
-\t mix {e/2) VN f ^ A "[( 1 ~ ^) t mix(e/2)] n \ +£ 
\n=0 "' J 

-Xt mix (e/2) VN p f . nai A ^ _ _L^| t mix (e/2)\ +£ 



where £ is the rest of the terms. Thus, the total variation distance, 
max||zAP r (0,T) -n f \\ T v < e~ xtm * x{e/2) e/2 + S N 

u 

whenever A (l — ^) > 1, i.e. K > 2 (\-i) • Taking K > xtmix ^ e ' 2 > ; we bound the error 
term 

<~< II <- II ^ i —\t - (f/1\ A n [(l — 7TT?')t m i x (e/2)] n ^mix^/ 2 ) . 

S N = \\£ -iTf \\tv < 1-e Mmt *W z > > — = 1-e 2k < e/2 



n=0 



as 



e<-21og(l-f). 



The condition K > 2 (\-i) * s me ^ s i nce we have taken A > 2t 6 ( e / 2 ) + 1 an d 
K > AW(£/2) . Therefore 

— e ^ 

j, ^ ^tmixi.^/2) 

e 

□ 

In the end, we would like to mention a possible application. The Glauber dynamics 
of a finite Ising model is a continuous time Markov process. Its mixing time can be 
estimated using path coupling. The adiabatic transformation of the Markov generator 
corresponds to an adiabatic transformation of the Hamiltonian. The above result can 
be applied to obtain the adiabatic time for the transformation. The result can be 
adjusted when the adiabatic evolution of the generator is nonlinear. 
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